• Artículo
      Icon

      A clustering approach to extract data from HTML tables 

      Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael (Elsevier, 2021)
      HTML tables have become pervasive on the Web. Extracting their data automatically is difficult because finding the ...
    • Artículo
      Icon

      A coral-reef approach to extract information from HTML tables 

      Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael (Elsevier, 2022)
      his article presents Coraline, which is a new table-understanding proposal. Its novelty lies in a coral-reef optimisation ...
    • Artículo
      Icon

      A hybrid quantum approach to leveraging data from HTML tables 

      Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael (Springer, 2022)
      The Web provides many data that are encoded using HTML tables. This facilitates rendering them, but obfuscates their ...
    • Ponencia
      Icon

      A Novel Approach to Web Information Extraction 

      Reina Quintero, Antonia María; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael (Springer, 2015)
      Business Intelligence requires the acquisition and aggrega tion of key pieces of knowledge from multiple sources in order ...
    • Ponencia
      Icon

      A Novel Approach to Web Information Extraction 

      Reina Quintero, Antonia María; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael (Springer International Publishing AG, 2015-06)
      Business Intelligence requires the acquisition and aggregation of key pieces of knowledge from multiple sources in order ...
    • Artículo
      Icon

      ARIEX: Automated ranking of information extractors 

      Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Sleiman, Hassan A. (Elsevier, 2016)
      Information extractors are used to transform the user-friendly information in a web document into structured information ...
    • Tesis Doctoral
      Icon

      Enterprise Data Integration: On Extracting Data from HTML Tables 

      Roldán Salvador, Juan Carlos (2020-12-22)
      The Web is a universal communication channel that provides a vast amount of valuable data about a plethora of topics. In ...
    • Tesis Doctoral
      Icon

      Enterprise Information Integration: New Approaches to Web Information Extraction 

      Jiménez Aguirre, Patricia (2015-10-19)
      La manera de entender la información ha cambiado radicalmente en las últimas décadas gracias a la Web, que impulsa a las ...
    • Ponencia
      Icon

      Una Experiencia para mejorar la interacción estudiante-profesor  

      Müller Cejás, Carlos; Salmerón, Inmaculada; Jiménez Aguirre, Patricia; Trinidad Martín Arroyo, Pablo (AENUI: Asociación de Enseñantes Universitarios de Informática, 2016)
      En asignaturas en las que hay proyectos o entregables evaluables, los estudiantes suelen saturar los buzones de correo ...
    • Ponencia
      Icon

      Extracting Web Information using Representation Patterns 

      Roldán Salvador, Juan Carlos; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael (Association for Computing Machinery (ACM), 2017)
      Feeding decision support systems with Web information typically requires sifting through an unwieldy amount of information ...
    • Ponencia
      Icon

      Feeding Software Agents with Web Information 

      Jiménez Aguirre, Patricia; Sleiman, Hassan A.; Corchuelo Gil, Rafael (Springer, 2015)
      Many software agents require information that is available in web documents. Unfortunately, the existing proposals to ...
    • Ponencia
      Icon

      Integrating Deep-Web Information Sources 

      Fernández de Viana, Iñaki; Hernández Salmerón, Inmaculada Concepción; Jiménez Aguirre, Patricia; Rivero, Carlos R.; Sleiman, Hassan A. (Springer, 2010)
      Deep-web information sources are difficult to integrate into automated business processes if they only provide a search ...
    • Ponencia
      Icon

      Mining Web Pages Using Features of Rendering HTML Elements in the Web Browser 

      Fernández, F. J.; Álvarez, José L.; Abad, Pedro J.; Jiménez Aguirre, Patricia (Springer, 2011)
      The Web is the largest repository of useful information available for human users, but it is usual that Web Pages do not ...
    • Artículo
      Icon

      On exploring data lakes by finding compact, isolated clusters 

      Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael (Elsevier, 2022)
      Data engineers are very interested in data lake technologies due to the incredible abun dance of datasets. They typically ...
    • Artículo
      Icon

      On Extracting Data from Tables that are Encoded using HTML 

      Roldán Salvador, Juan Carlos; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael (Elsevier, 2020)
      Tables are a common means to display data in human-friendly formats. Many authors have worked on proposals to extract ...
    • Ponencia
      Icon

      On Extracting Information from Semi-structured Deep Web Documents 

      Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael (Springer, 2015)
      Some software agents need information that is provided by some web sites, which is difficult if they lack a query API. ...
    • Ponencia
      Icon

      On improving FOIL Algorithm 

      Jiménez Aguirre, Patricia; Arjona, José L.; Álvarez, J.L. (CSREA Press, 2011)
      FOIL is an Inductive Logic Programming Algorithm to discover first order rules to explain the patterns involved in a ...
    • Artículo
      Icon

      On Learning Web Information Extraction Rules with TANGO 

      Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael (Elsevier, 2016)
      The research on Enterprise Systems Integration focuses on proposals to support business processes by re-using existing ...
    • Ponencia
      Icon

      On Member Labelling in Social Networks 

      Corchuelo Gil, Rafael; Reina Quintero, Antonia María; Jiménez Aguirre, Patricia (Springer, 2015)
      Software agents are increasingly used to search for experts, recommend resources, assess opinions, and other similar tasks ...
    • Artículo
      Icon

      On the synthesis of metadata tags for HTML files 

      Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Gallego, Fernando O.; Corchuelo Gil, Rafael (Wiley, 2020)
      RDFa, JSON-LD, Microdata, and Microformats allow to endow the data in HTML files with metadata tags that help software ...